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ABSTRACT 

implicit-function  theorem  is  established  for  a multifunction 
consisting  of  the  sum  of  a differentiable  function  and  a maximal  monotone 
operator.  Applications  to  nonlinear  complementarity  problems, 
mathematical  programming  problems,  and  economic  equilibria  are  pointed 
out.  An  application  to  the  analysis  of  a general  Newton  method  for 
solving  variational  inequalities  is  treated  in  some  detail. 
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AN  IMF  LICIT- FUNCTION  THEOREM  FOR  GENERALIZED 


VARIATIONAL  INEQUALITIES 

$ 

Stephen  M.  Robinson 

1 • Introduction.  In  this  paper  we  shall  study  the  behavior  of 
solutions  of  the  inclusion 

0 c f(x)  + T(x)  , (1.1) 

where  f is  a continuously  Frechet  differentiable  function  from  an  open 
set  N IR  into  IR  and  T is  a maximal  monotone  operator  from 
IR  into  itself  (recall  that  an  operator  T is  monotone  if  for  each 
(x|.w|)»  (X2,W2^  in  graph  T one  has 

<x1  - x2,  Wl  - w2>  > 0 , 

where  (• , • ' denotes  the  inner  product,  and  maximal  monotone  if  its 
graph  is  not  properly  contained  in  that  of  any  other  monotone  operator). 

We  shall  be  particularly  interested  in  conditions  which,  when  imposed 
on  f and  i,  will  ensure  that  the  set  of  solutions  to  (1.1)  remains 
nonempty  and  is  well  behaved  (in  a sense  to  be  defined)  when  f is 
subjected  to  small  perturbations.  To  introduce  these  perturbations,  we 
shall  make  use  of  a topological  space  P and  a function  f : P x N - IRn, 
so  that  we  can  replace  (1.1)  by 

0 e f(p,x)  + T(x)  , (1.2) 
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and  study  the  set  of  x which  solve  (1.  2)  us  p varies  near  a base 


value  pQ. 

A particular  case  of  (1.2)  of  special  interest  for  applications  is  that 

in  which  T is  taken  to  be  the  operator  where  for  a closed  convex 

set  C IRn  one  defines  the  indicator  function  4^  of  C by 

f 0,  x e C 
Ux)  :=  < 

1^+°°,  x ^ C , 

and  where  d denotes  the  subdifferential  operator  [16,  §23).  This  yields 
the  so-called  variational  inequality 

0 e f(p,  x)  + 8^c(x)  , (1.3) 

which  expresses  analytically  the  geometric  idea  that  f(p,  x)  is  an 
inward  normal  to  C at  x. 

Many  problems  from  mathematical  programming,  complementarity, 
mathematical  economics  and  other  fields  can  be  represented  in  the  form  (1.  3): 
for  example,  the  nonlinear  complementarity  problem 

F(x)  e K* 

x € K (1.4) 

(x,  F(x))  = 0 

where  F : IRn  - IRn,  K is  a nonempty  polyhedral  convex  cone  in  IRn, 
and  K :=  {y  € IRn  I < y,  k)  > 0 Vk  t K),  can  be  written  as 


Further  information  on  nonlinear  complementarity  problems  (often  with 
K IRn,  the  non-negative  orthant)  may  be  found  in,  e.g.,  [ 2),  [3],  [8], 

| 9).  The  Kuhn-Tucker  necessary  conditions  for  mathematical  programming 
[ 7 | form  a special  case  of  (1. 4);  e. g. , for  the  problem 

minimize  9(y) 

subject  to  g(y)  < 0 (1.  5) 

h(y)  = 0 

m q 

where  0,  g and  h are  differentiable  function  from  IR  into  IR,  IR 
and  IRr  respectively,  one  has  the  Kuhn-Tucker  conditions 

0'(y)  + ug'(y)  + vh'(y)  = 0 

g(y)  < 0 
h(y)  = 0 
u > 0 

<u,  g(y)>  = o , 


and  these  can  be  written  in  the  form  (1.4)  by  taking  n = m + q + r, 
K = IRm  x 1R^  x IRr,  X = (y,  u,  v)  and 


F(x) 


[ 0'(y)  + ug'(y)  + vh'(y)J 

-g(y) 


L -My)  J 

There  are  also  important  applications  of  (1.  3)  to  economic  equilibrium 
problems  [ S),  [18),  [19),  among  others.  It  is  of  interest  to  note  that 
in  most  of  the  applications  mentioned  one  finds  that  C is  a polyhedral 
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convex  set,  and  we  shall  see  that  particularly  strong  results  can  be 
obtained  for  such  problems. 

The  organization  of  this  paper  is  as  follows:  in  the  next  section 
we  shall  state  an  implicit-function  theorem  (Theorem  1)  for  systems  of 
the  form  (1.2);  we  then  specialize  this  result  to  the  case  in  which  T 
has  a certain  polyhedrality  property  and  obtain  a form  (Theorem  2) 
which  is  useful  in  applications . In  the  remainder  of  Section  2 we  shall 
give  the  proof  of  Theorem  1.  In  Section  3 we  shall  prove  Theorem  2, 
after  having  established  some  necessary  results  about  polyhedral  sets 
and  functions.  Finally,  in  Section  4 we  shall  apply  these  results  to 
prove  implementability  and  quadratic  convergence  of  a general  Newton 
algorithm  for  computational  solution  of  variational  inequalities. 

2.  Main  results.  Before  stating  the  main  theorem,  we  require  a 
preliminary  definition  dealing  with  a certain  continuity  property  of  multi- 
valued functions  (or  multifunctions,  as  we  shall  call  them). 

DEFINITION:  Let  X and  Y be  normed  linear  spaces . A multi- 
function F : X -Y  is  upper  Lipschitzian  with  modulus  \,  or  U.L.(\), 
at  a point  xQ  c X with  respect  to  a set  V C X,  if  for  each  v e V one  has 

F(v)  C F(xQ)  + X II v - xQ  II  By  , 

where  By  is  the  unit  ball  in  Y.  We  say  F is  locally  U.L.(\)  at  xQ  if  it 
if  U.L.(X)  at  xQ  with  respect  to  some  neighborhood  of  x^. 
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This  property  is  close  to  the  Lipschitz  continuity  for  multifunctions 
define  i by  Rockefeller  | 17,  §3),  except  that  we  do  not  require  F(Xq) 
to  ue  e singleton;  in  the  problems  we  shall  consider  F(x^)  will  often 
be  multivalued. 


Our  principal  result  is  the  following  theorem,  in  which  we  use  f 
* 

to  denote  the  partial  Frechet  derivative,  with  respect  to  the  second 
argument,  of  e function  of  two  variables;  B denotes  the  unit  ball  in  lRn 
with  respect  to  the  Euclidean  norm,  which  is  used  throughout  the  remainder 
of  the  pap-r. 


THEOREM  1:  Let  P be  a topological  space,  N an  open  set  in  JRn 
and  T a maximal  monotone  operator  from  IRn  into  itself.  Let  t be 
a continuous  function  from  P x N into  lRn  such  that  f is  continuous 
on  P x N . Let  pQ  <=  P;  suppose  that  there  are  a nonempty,  bounded 
subset  XQ  of  N and  numbers  \ and  p > 0 such  that  for  each  xQ  e XQ 

i)  (Lf  + T)  * is  U.L.(\)  eit  0 with  respect  to  r|B,  where 
X0 


Lfx0('):=  f(p0»X0)  + f2(P0’X0)[(,)  'X0J  ’ 

ii)  (Lf  + T)_1(0)  = X and 
X0  ° 

iii)  f^( Pq’  xq)  is  positive  semidefinite . 

Then  there  exist  a number  6 > 0 and  a neighborhood  U(pQ)  such 


that  with 


£(p)  : = 


{x  € XQ  + 6B  |0  € f(p,  x)  + T(x) },  p t U 


0, 


Pi  u 
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iff '<A 


one  has: 

1)  £ is  upper  semicontinuous  from  U to  IRn, 

2)  S(P„)  - X0, 

and 

3)  For  each  e > 0,  for  some  neighborhood  U (p„)  and  for  each  p L . 

e o e 

<t>  * £(p)  C £(pq)  + (\  + e)aQ(p)B  , 

where 

aQ(p)  :=  max{  ||f(p,  x)  - f(pQ,  x)  II  lx  € XQ}  . 

Note  that  if  P is  actually  a normed  linear  space  and  if  f(p,  x) 

is  Lipschitzian  in  p uniformly  over  x€  XQ,  then  for  some  constant  p 

and  each  p e U we  have 
e 

£(p)  C Z(pQ)  + (X  + e)p  Jlp  - PQ  II  B, 

so  that  Z is  locally  U.L.  [ (\  + e)p]  at  pQ. 

We  next  state  a specialization  of  Theorem  1 to  the  case  in  which  T 
has  a certain  additional  property,  described  in  the  following  definition. 

DEFINITION  2:  A multifunction  0 : IRn  -*•  F™  is  polyhedral  if  its 
graph  is  the  union  of  a finite  (possibly  empty)  collection  of  polyhedral 
convex  sets  (called  components). 

Here  we  use  "polyhedral  convex  set"  as  in  [16,  §19]. 

THEOREM  2:  Let  P and  N be  as  in  Theorem  1.  and  let  f : P x N — ]Rr‘ 
with  f and  f^  continuous  on  P x N.  Let  T be  a polyhedral,  maximal 
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monotone  operator  from  IRn  into  ]Rn,  and  suppose  that  there  are  an 
n x n positive  semidefinite  matrix  A and  a point  a e IRn  such  that  with 

XQ  :=  {x  € ]Rn|0  r Ax  + a + T(x)} 

one  has 

a.  XQ  is  a nonempty,  bounded  subset  of  N,  and 

b.  For  each  x e X Lf  ( • ) e A(  ■ ) + a. 

u u xQ 

Then  the  conclusions  of  Theorem  1 hold;  the  number  \ can  be  take:,  to  be 
a local  upper  Lipschitz  modulus  at  0 for  [ A(  • ) + a + T(  ■ ) ] \ and 
this  number  exists  without  further  assumptions. 

We  shall  prove  in  Section  3 that  if  <p  is  a polyhedral  convex 
function  on  IRn  (i.e.,  one  whose  epigraph  is  a polyhedral  convex  set) 
which  never  takes  the  value  -»,  then  dip  is  a polyhedral  multifunction 
in  the  sense  of  Definition  2.  In  this  case  it  is  well  known  that  d<p  is 
also  maximal  monotone  [16,  Cor.  31.  5.2],  so  the  result  in  Theorem  2 
is  valid  for  T = dip.  In  particular,  Theorem  2 may  be  applied  to  the 
variational  inequality  (1.  3),  for  the  case  in  which  C is  a polyhedral 
convex  set,  by  taking  ip  = ip  . One  thus  obtains  an  implicit-function 
theorem  for  (1.  3);  by  specializing  further  to  the  case  P : = IR  , pQ  :=  0, 
and  f(p,  x)  : - f(x)  - p,  one  obtains  an  "inverse-function"  theorem 
dealing  with  inclusions  of  the  type 

p r f(x)  + ao-c(x), 

for  p near  0.  This  result,  incidentally,  provides  a convenient  tool 
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for  verifying  that  the  operator  (f  + °vq)  ^ is  'Lipschitz  continuous  at  0,  " 

in  the  sense  required  in  [ 17,  §3). 

PROOF  OF  THEOREM  1:  Denote  by  dom  G the  effective  domain 

of  a multifunction  G:i.e.,  the  set  {xiG(x)t^}.  Choose  any  xq  " 

by  (in).  f jp  , x ) Is  positive  semidefinite,  so  Lf  is  a maximal 
ZOO  xQ 

monotone  operator.  As  T is  also  maximal  monotone  (hence  nonempty) 
and  as  the  effective  domain  of  Lf  is  all  of  1R  , we  have  from  [IS] 

xo 

that  the  operator  Q(x  ) :=  Lf  + T is  maximal  monotone;  hence  so  is 

_iX° 

its  inverse.  By  (ii),  Q(xq)  (0)  = XQ,  a bounded  set,  and  as  by  (i) 

Q(x0)_1  is  locally  U.L.(\)  at  0,  we  find  that  it  is  locally  bounded 
there;  in  fact,  it  must  be  locally  bounded  at  every  point  of  int  nB,  since 
the  image  of  some  ball  around  such  a point  will  be  contained  in  the  image 
of  nB,  which  in  turn  is  contained  in  the  bounded  set  XQ  + \nB.  But 
then  from  [14,  Th.  1)  we  have  that  int  nB  cannot  contain  any  boundary 
point  of  dom  Q(xq)  S however,  as  int  nB  meets  dom  Q(xq)  (at  °) 
and  is  connected  we  finally  conclude  that  int  nB  C int  dom  Q(Xq)  • Thus, 
for  each  y with  II y II  < n the  set  0(xQ)  \y)  is  nonempty  and  bounded; 
it  is  also  closed  and  convex  by  maximal  monotonicity,  hence  compact. 

In  particular.  X()  is  a compact  convex  set. 

Define,  for  two  subsets  A and  C of  P and  a point 
x . IRn,  d[  x,  C|  inf 3 ||x  - c II  [c  t C}  and  d[  A,  C]  = sup{d[  a,  C]  la  £ A), 
where  the  supremum  and  infimum  of  <t>  are  defined  to  be  -0°  and  +<*> 
respectively.  Denote  by  it  the  projection  from  R onto  X^;  tt  is 
well  known  to  be  nonexpansive,  hence  a fortiori  continuous. 


-8- 


r 


Usmy  continuity  and  compactness,  one  can  show  that  the  function 
I*6)  : max!  ilf2(P0,  x)  - f2(pQ,  tt( x) ) II  ix  f XQ  + 6 B } 

is  well  defined  for  small  6,  and  is  continuous  at  0 with  (3(0)  = 0. 

Thus,  we  can  choose  a positive  6 small  enough  that  the  set  X.  :=  X + 6B 

o 0 

is  contained  in  N,  and  such  that  \|3(6)  < j and  6 p( 6)  < jn.  It  is  not 
difficult  to  show  that  for  this  fixed  6 the  function 

«6(p)  J=  max',  llf(p,  x)  - f(pQ,  x)  ||  |>:  e X& } 

is  well  defined  for  all  p e P,  and  is  continuous  at  p_  with  a.(p  1 = 0. 

0 6 0 

Thus,  we  can  choose  a neighborhood  L(p  ) such  that  for  each  p e U, 
a&(P)  < ^ h anci  ^a&(P)  5 ^ &.  Now  choose  any  p € U,  and  define  a 
multifunction  F from  X,  into  IRn  by 

Fpi.x)  :=  Q(ir(x))  J[Lfw(x)(x)-  f(p,x)J  . 

If  x is  any  point  of  X , we  have 

‘lLfTr(x)(x)  * f(P»x)H  < IU(P,x)  - f(pQ,  x)  ||  + ilf(p0,x)  - Lfff/x)(x)||  . (2.1) 

Now  define  (for  this  fixed  x)  a function  of  one  real  variable  t by 
g(T)  :=  f(PQ,  tx  + (1  - t)tt(x))  - Lf^,(Tx  + (1  - t)tt(x))  . 

We  find  that 

iif(pQ,  x)  - Lf^xj(x)  II  = ||g(l)  - g(0)  ||  < sup{  ilg'(T)  ||  |o  < t < 1}  . 
However,  for  t t (0,1], 

g'(T)  = [ f2(P0.xx)  ' f2(P0>  tt(x>)  I ( x - ir(x)J  , 


■ 


where  xt  : - Tx  + (1  - t)tt-(x,).  We  have  by  properties  of  the  projection 
that  tt(xt)  = n(x),  so 

^yP0’XT*  ~ f2(P0’  *(x^l  = fif2(p0’ XT>  " ^’Po' 1 * 

since  x e Xf . Hence 
t i 

llf(p0,x)  - Lf^ .(x)  I!  < p(6)  (| X - it(x)  ||  . (£.1) 

As  II  f(p,  x)  - f(pQ,  x)  II  < Qf(p),  we  have  from  (2.1)  and  (2.2) 

^Utt(x)(x)  " f(p’x)  ® = °'6(P)  + _ (2.  3) 

< j n + j n = r,  . 

Hence,  by  our  previous  remarks  Fp(x)  is  a nonempty  compact  convex  set 
for  each  x € X . Also,  using  (i),  (ii)  and  (2.3)  we  have  for  x € X. 

U ft 

dfFp(x)-X0l  = d[  Q(tt(x))_1[  Lf7r(x)(x)  - f(p,  x)  |,  Q(^(x))'1(0)] 

= X ^LfT r(x)(X^  " f^P’  X^  ^ (2-  4) 

< \ab(  p)  + \(3(f)  ||  x - it(x)  II  5 2 6 + 2 L = 6 ’ 

so  F carries  X,  into  itself.  We  have 
P o 

graph  Fp  * {(x,  y)  lli^x)  - f(p,x)  e Lf^y)  + T(y)} 

= ((x»  y)lo  € f(p,  x)  + f2(pQ,ir(x))(y  - x)  + T( y) } . 

Using  the  continuity  of  f,  f,,  and  tt,  together  with  the  fact  that  T 

is  closed  (by  maximal  monotonicity),  one  can  show  without  difficulty 

that  graph  Fp  is  closed  in  X&  xXft.  We  can  thus  apply  the  Kakutani 

fixed-point  theorem  to  conclude  that  there  is  some  x e X,  with 

P £ 

x e F (x  );  that  is, 

P P P 
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so 


Lf  . x ) - f(p,x  ) € Lf  , ,(x  ) + T(x  ) , 

n x p'  p'  ir  x ) p p'  ’ 

P P 


0 € f(p,  xp)  + T(x  ) 

and  thus  x c 2(p),  which  is  therefore  nonempty.  We  have 
P 

graph  2 = {(p,  x)t  U xxjoe  f(p,  x)  + T(x)}  ; 

this  is  closed  in  U xX§  by  joint  continuity  of  f and  closure  of  T. 

However,  the  range  of  2 is  contained  in  the  compact  set  X.;  thus 

0 

by  [10,  Lemma  4.4]  2 is  actually  upper  semicontinuous  from  U to  Xc  . 

6 

If  X0€X0  then  by  (ii)  one  has  0 e Lfx  (xQ)  + T(xQ)  = f(pQ,  xQ)  + T(xQ), 
so  Xg  e 2(Pq)  and  thus  2(p^)  On  the  other  hand,  if  x e 2(p^) 

then  xtX{  and  0 c f(p  , x)  + T(x);  therefore 

Lfw(x)(x) -f((,0'x,£Lf1,(x)(x)  + T(x) 

so  that  x e F (x).  As  xt  Xt,  we  have  from  (2.4)  with  p = p that 

Pg  0 o 

d[x,XQl  < d[Fp  (x),  XQ  ] < \||Lfw(x)(x)  - f(p0,  x)  ||  . 


But  from  (2.  3)  with  p = p we  find  that 

^L£tt(x)(x)  " f(P0’x)l1  = " "(X)H  = P(6)d[x,X0]  . 

Thus 

dKXgl  = ^)d[x,XQl  < |d[x,X0]  , 

implying  that  xeXQ  since  XQ  is  closed.  Thus  we  actually  have  2(pQ)  = XQ. 

Now  take  any  e > 0;  find  6 e(0,6]  such  that  for  ae[0,6  ] 
one  has  \(3(<r)  < e/(\  + e ).  One  can  show  that  the  function 
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\(p)  : max'  f_(p,  x)  - f ,(p„,  x)  ||  ; x ► + 6 B}  is  well  defined  on  P 

c c o o e 

and  is  continuous  at  p„:  choose  a neighborhood  U (p.)  C U so 

U E U 

that  if  p - U£  we  have  £(p)  Z(pQ)  + and  ^ViP)  5 "J  e/(X  + E)- 

Now  choose  any  p e and  any  x € £<p).  Using  (2.4)  and  the  fact 

that  x t F (x),  we  have 
P 

d[x,Z(p0)l  < d(Fp(x),XQJ 

< X li  Lf  , »(x)  - f(p,  x)  || 

< \ ||h(x)  - h(ir(x))  ||  + x|[h(ir(x))  || 

+ X ||f(p0,  x)  - Lfw(xj(x)  II  , 

where  h(x)  :=  f(p,x)  - f(pQ,x).  If  we  define,  as  before,  xt  :=  tx  + (1 
we  have 

llh(x)  - h(ir(x))  ||  < ||x  - tt ( x)  || sup { l|h'(xT)  ||  |o  < t < 1}  . 

But  h'(xT)  = f2(p,  xT)  - f2(pQ,xT),  so 

llh(x)  - h(iT(x))li  < \(p)  II x - n(x)  ||  . 

Thus,  using  (2.2),  (2.  5)  and  the  fact  that  |lh(tr(x))||  < aQ(p),  we  have 
d[  x,  Z(pQ)  J < \ y(p)  II x - ir(x)  ||  + X3Q(p)  + X ftf^)  II x - tt(x)  || 

< [e/(X  + e)|  II x - ir(x)  ||  + XaQ(p)  . 

But  ii  x — ?r(  x)  il  = d[  x,  Z(pQ)],  so  if  X >0  we  obtain 

|V/(X  + e)  J d[  x,  2J(pQ)  | < XaQ(p) 

and  thus 

d[  x,  2J(pq)  1 < (X  + e)aQ(p)  . 
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(2.5) 

TM*), 


(2.6) 


On  the  other  hand,  if  \ 0 then  (2.S)  implies  that  d[  x,  ( Pq ) J = 0, 

in  which  case  (2.6)  holds  trivially.  In  either  case,  therefore, 

Z(p)  S(PQ)  + (\  + e)oQ(p)B  , 

which  completes  the  proof. 

In  order  to  apply  Theorem  1,  one  has  to  be  able  to  verify  the 
uniform  upper  Lipschitz  continuity  of  the  multifunctions  QfXg)  * for  x^  e X . 
As  this  may  not  always  be  easy  to  do,  it  is  of  interest  to  avoid  this 
requirement  uy  employing  Theorem  2 in  those  cases  to  which  it  is  applic- 
able. We  shall  therefore  proceed  to  establish,  in  the  next  section, 
results  about  polyhedral  multifunctions  which  will  allow  us  to  prove  Theorem  2. 

3.  Polyhedral  multifunctions.  We  have  already  defined  the  class 
of  polyhedral  multifunctions,  and  it  is  clear  from  the  definition  that  such 
a multifunction  is  always  closed.  Our  goal  here  is  to  show  that  it  is  in 
fact  locally  upper  Lipschitzian  at  each  point  with  a uniform  modulus;  to 
do  this  we  require  several  lemmas. 

LEMMA  1:  Let  P : IRn  -*  IRm  be  a nonempty  polyhedral  multifunction 

and  let  G.,  1 < i < k,  be  its  components-  Let  xQ  € dom  P,  and 

define  fi  !x_  e -n  (G.) }.  where  tt  is  the  canonical  projection  of 

0 x i x 

lPn  x IRm  on  IRn.  Then  there  exists  a neighborhood  U(xQ)  such  that 

(U  X lRm)  fl  graph  P C U G.  . 

i € J 1 

PROOT:  Tor  each  i,  both  (xQ}x  IR  and  G^  are  nonempty 
polyhedral  convex  sets  in  Ftn  m.  If  i ^ .9  they  do  not  meet,  and  thus 
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\ 


I 

Mm. 


by  | 16,  Cor.  19.  3.  3]  they  can  be  strongly  separated.  In  particular,  then, 
there  is  some  neighborhood  U.(x)  such  that  (U^  x IRm)  fl  G,  - j . Let 
U ri  U.;  this  U is  a neighborhood  of  xQ  because  the  number  of 

i 4 J 1 

components  is  finite.  Clearly, 

k 

(U  X IRm)  D graph  PC(U  G.)\(  U G.)  - U G , 

1-1  ' i^.J  ' it  J ' 

and  this  completes  the  proof. 

Lemma  1 tells  us  that  if  x e U and  y € P(x),  then  (x,  y)  belongs 

to  a component  G.  which  also  contains  (xQ,  yQ)  for  some  yQ  e P(xQ). 

In  addition  to  this  information,  wo  also  need  an  extension  of  a basic 

theorem  of  Hoffman  [ 4]  about  linear  inequalities. 

LEMMA  2.  (Generalized  Hoffman  Theorem):  Let  G be  a nonempty 

polyhedral  convex  set  in  IRn  x ]Rm.  Let  tt  and  it  be  the  canonical 

x y 

projections  of  IRn  x ]Rm  onto  ]Rn  and  lRm.  For  z :=  (x,  y)  c tt  (G)  x r (G) 

x y 

define 

dx(z,  G)  :=  min{  II x - x II  |(x,  y)  ■-  G} 

and 

d (z,  G)  :=  mln{  lly  - y||  |(x,  y)  G)  . 

Then  there  exist  6.  n such  that  for  each  z e tt  ( G)  x n (G) 

x y\ 

d (z,G)  < nd  ( z,  G);  d (z,G)  < id  (z,  G)  . 
a - y y - x 

PROOF:  As  G is  polyhedral,  we  can  represent  it  in  the  form 

G = {(x,  y)  e IRn  x !Rm  | Ax  + By  < c)  , 
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so 


Lf  ,(x  ) - f(p,  x ) € Lf  , ,(x  ) + T(x  ) , 

t r(x  )'  p'  ’ p tt( X ) p p ’ 

P P 


0 € f(p,  x ) + T(x  ) 
P P 


and  thus  x^  > Z(p),  which  is  therefore  nonempty.  We  have 
graph  Z = {(p,  x)  e Uxxjoe  f(p,  x)  + T(x)}  ; 


this  is  closed  in  U XX,  by  joint  continuity  of  f and  closure  of  T. 

O 

However,  the  range  of  Z is  contained  in  the  compact  set  Xf ; thus 
by  [10,  Lemma  4.4]  Z is  actually  upper  semicontinuous  from  U to  Xr  . 
If  xQ  e XQ  then  by  (ii)  one  has  0 e Lf^  (xQ)  + T(xQ)  = f(PQ»  XQ)  + T(xQ), 
so  xQ  € Z(P0)  and  thus  Z(pQ)  O • On  the  other  hand,  if  x e Z(pQ) 
then  x € Xft  and  0 c f(pQ,x)  + T(x);  therefore 

Lf  ,JX)  ~ f(Pn’x)  6 Lf  rvi(x)  + T(x) 

Trfxj  0 tt(x) 

so  that  x e F (x).  As  x e Xc,  we  have  from  (2.4)  with  p = p„  that 
PQ  6 0 

d[  x,  XQ ] < d[  Fp  (x),  XQ]  < X HLfff(x)(x)  - f(PQ,  *)  II  • 

But  from  (2.  3)  with  p = pQ,  we  find  that 

^LfTr(x)^  " f<P0’x)li  = ^6^lx  ~ W(X)H  = P(6)d[x,  XQ)  . 

Thus 


d[x,X0|  < \(3{6)d[x,X0]  < “■  d[  x,  XQ  ] , 


implying  that  x e XQ  since  XQ  is  closed.  Thus  we  actually  have  Z(pQ)  = XQ. 

Now  take  any  e > 0;  find  f>  e (0,  6 ] such  that  for  a e [ 0,  6 ] 

£ £ 

one  has  \p(a)  < e/(\  + e ).  One  can  show  that  the  function 
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It  will  be  of  interest  to  establish  some  simple  methods  for 
constructing  polyhedral  multifunctions  from  other  such  functions.  To 
begin  with,  since  polyhodrality  was  defined  in  terms  of  graphs,  it  is 
clear  that  the  inverse  of  any  polyhedral  multifunction  is  itself  polyhedral. 

It  is  also  easy  to  show  that  if  F is  polyhedral  and  \ t 1R,  then  \F 
is  polyhedral.  The  next  result  shows  that  the  sum  of  two  polyhedral 
multifunctions  is  also  polyhedral. 

PROPOSITION  2:  Let  F and  H be  polyhedral  multifunctions  from 
lRn  into  lRm . Then  F + H is  polyhedral. 

PROOF:  If  dom  F 0 dom  H = 0 then  F + H is  empty,  hence 
certainly  polyhedral.  Otherwise,  let  G , ...,G  be  the  components  of  T 
and  let  ...,Xf  be  those  of  H.  Let  P:=  {(i,  j)  l"x(G.)  D tt^K.)  * 0 }; 
for  (i,  jf  - p let 

G.  A K :=  {(x,  y + z)  |(x,  y)  e G.,  (x,  z)  € K. } . 

We  note  that 

G,  n,  = C[  (G.  X K. ) fl  D(IRn  X ]Rm  X IRm)  ] , 

1 j 1 j 1 ’ 

where  C and  D are  linear  transformations  defined  by 

C : IRn  x lRm  x lRn  x ]Rm  -»  IRn  x ]Rm;  C(x,  y,  v,  w)  :=  (x,  y + w) 

and 

D : IRn  X JRm  x lRm  lRn  x Fm  x IRn  x IR™;  D(x,  y,  w)  (x,  y,  x,  w)  . 
Hence  G.  K is  a nonempty  polyhedral  convex  set.  Now  if  (x,v)  t graph 
(F  + H)  then  v = y + z with  (x,y)  f graph  F and  (x,  z)  € graph  H. 
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Tor  some  i and  j,  we  have  (x,  y)  e G.,  (x,  z)  K,  and  d.i)  y> 

i ) 

so  that  (x,  y t z)  c G.  i K.  and  hence 

graph ( F + H)  C U G.  A K . 

(>,))•  P 1 1 

On  the  other  hand,  if  (x,v)  belongs  to  the  set  on  the  right  then  for 

some  (i,  j)  - P,  (x,  v)  e G]  * K . Hence  v - y 4 z with  fx,y)  G graph  r 

and  (x,  z)  r IT  graph  H,  so  we  have  (x,  v)  e graph!  F + H).  Thus 

graph(F  + H)  = U G.  A K . , 

(i,j)fP  1 1 

and  therefore  F + H is  polyhedral,  as  was  to  be  shown. 

We  can  now  establish  Theorem  <1,  using  these  results. 

PROOF  OF  THEOREM  2:  We  shall  prove  that  the  hypotheses  of 
Theorem  1 are  satisfied;  indeed,  those  listed  under  (ii)  and  (iii)  are 
certainly  satisfied  under  the  assumptions  of  Theorem  2.  To  prove  (i),  it 
suffices  to  show  that  the  multifunction  [ A(  • ) + a + T(  • ) ] _1  is  locally 
upper  Lipschitzian  at  the  origin.  It  is  evident  that  A(  • ) 4 a is  polyhedral, 
whereas  T is  polyhedral  by  assumption;  hence  by  Proposition  2 
A(-)  4 a 4 T{  • ) is  polyhedral,  and  thus  so  is  its  inverse.  The  upper 
Lipschitzian  property  now  follows  from  Proposition  1,  and  this  proves  Theorem  2. 

We  conclude  this  section  by  proving  our  earlier  assertion  that  if  f 
is  a polyhedral  convex  function  which  never  takes  the  value  - then 
df  is  a polyhedral  multifunction.  The  proof  employs  two  preliminary 
lemmas,  in  which  for  y,  z v.  D<n  we  use  the  notation  [ y,  z | to  mean 
f(I  - \)y  4 \z|o  < \ < 1}. 
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LEMMA  3:  H C if  nonempty,  closed  and  convex  in  IRn  with 


y,  z • C,  then  for  each  x,  ri[y,  z|,  9^(x)  C d^(y).  Consequently. 


• ) is  constant  on  the  relative  interior  of  any  convex  subset  of  C. 


PROOF:  As  xe  ri[y,  z]  we  can  write  x = (1  - \)y , + \z,  with 


X • (0,1).  Suppose  x e 9kJj„(x)  (which  is  nonempty  since  x-  C); 

V 


let  c be  any  point  in  C.  We  have  (1  - \)c  + Xz  C;  thus 


(x  ,[(1-X)c  + Xz)  - x)  < 0 . 


-1, 


But  [ (1  - X)c  + Xz ) - x = (1  — X)[c  — (1  — \ ) (x-Xz)]=(l-X)(c-y); 


s}c 

consequently,  <x  ,c  - yN  < 0 so  x e 9^(y).  Thus  90^.(x)  9v^Jy). 


To  prove  the  second  assertion,  let  F be  a convex  subset  of  C.  If  F 
is  empty  or  a singleton  the  result  is  immediate;  therefore  suppose  that  F 
(and  hence  ri  F)  contains  more  than  one  point:  let  x,  y t ri  F with 
x * y.  By  | 16,  Th.  6.1),  there  exist  x,y  e F and  X,  p •"  (0, 1)  such  that 

x = (1  - x)x  + Xy,  y = (1  - p)y  + rx  . 

But  then  xt  ri[x,  y),  so  9i(iJx)  c 9v^(y),  and  y ri[y,  x],  so 
9vi^(y)  9i|^.(x).  Thus  9i^_,(x)  = 9a^(y),  and  so  94>^(  • ) is  constant  on 

ri  F,  which  completes  the  proof. 


We  can  now  prove  that  94j  is  polyhedral  for  polyhedral  C. 


LEMMA  4:  Let  C be  a nonempty  polyhedral  convex  set  in  IR  . 


Then  9o  is  a polyhedral  multifunction. 


PROOF:  By  | 16,  Th.  19.1),  C has  finitely  many  faces  F , ...,F  , 


and  by  f 16,  Th.  18.  Z | the  collection  (ri  F^  is  a partition  of  C.  By 
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Lemma  3,  on  each  set  ri  F.  the  function  9vjj„(  • ) has  a constant  value, 

1 N-/ 

a set  which  we  will  call  P. . By  [ 16,  Th.  <!3.10|,  P is  a nonempty 
polyhedral  convex  set.  Thus, 


graph 


m 

[ (ri  F ) x P. ) . 

1 = 1 


Now  consider  the  set 

m 

I : = U (F  x P.)  . 

1 = 1 1 1 

& 

Clearly  graph  r.  On  the  other  hand,  suppose  that  (x,  x ) £ T; 

3je  # 

then  for  some  i,  x f F and  x £ P,.  If  x < ri  F.  then  (x,  x ) £ graph  dv|>_,. 

ill  u 

If  x 4 ri  F.,  then  there  is  some  z * x.  with  z a ri  F,  (since  ri  F.  * a). 

1 i l 

Using  [16,  Th.  6.1[  we  can  find  yt  F.  and  X.  t iO,l)  such  that 

z = (1  - \)y  + \x.  But  then  z £ ri[y,  x),  and  by  Lemma  3 we  have 

P^  = 9^(z)C  9ii^,(x).  However,  as  (ri  F is  a partition  of  C and 

as  x £ C (since  F.  C for  each  i),  there  exists  some  j such  that 

$ 

x £ ri  F..  But  then  9ij^(x)  = P,  and  we  have  P.  C P . As  x t P., 

) C'  j i j i' 

Of 

¥ 

we  have  (x,  x ) = (ri  F.)  x P.  c graph  Hence 

J J C 

m 

graph  9il  = I'  = U (F  x P.)  . 

C i = 1 1 1 

But  each  F.  is  a polyhedral  convex  set  (as  a face  of  C),  so  F.  x P^ 
is  polyhedral  and  9Vr.  is  a polyhedral  multifunction.  This  completes 


the  proof  of  Lemma  4. 


But  if  f(y)  = 4oo  then  the  last  inequality  holds  trivially,  and  so 

(x,  x ) e graph  3f.  This  establishes  (3.1). 

Now  if  f is  polyhedral  then  by  definition  e is  a polyhedral  convex 

set;  hence  by  Lemma  4 graph  is  the  union  of  finitely  many  polyhedral 

convex  sets,  say  F^,  . . . , P^.  Denote  by  F the  set  lRn  x ]R  x ]Rn  x f-1), 

also  obviously  polyhedral  convex.  Using  (3.1),  we  have 

m 

graph  3f  = D[(iJ  P.)  0 F] 
i = l 1 

m 

= D[  U (P  0 F)] 

1 = 1 ’ 

m 

= u [ D(P.  n F)]  , 
i = 1 1 

and  since  D(P.  0 F)  is  polyhedral  for  each  i,  we  see  that  3f  is  a 
polyhedral  multifunction.  This  proves  Proposition  3. 

4 . An  application:  Newton's  method  for  variational  inequalities . 

We  shall  analyze  here  an  algorithm  for  solving  variational  inequalities 

over  polyhedral  convex  sets.  To  specify  precisely  the  form  of  the  variational 

inequalities  with  which  we  shall  deal,  we  take  x„  to  be  a point  in  IRn, 

* 

U a neighborhood  of  x,,  and  f a (Frechet)  differentiable  function  from 
U into  IRn  m.  We  let  A be  a linear  transformation  from  IRm  into 
lRn*m,  and  take  a to  be  a point  in  lRn  + m and  C a nonempty  polyhedral 
convex  set  in  lRn*m.  We  shall  consider  the  variational  inequality 
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14.1) 


0 t f(x)  + Ay  + a + th*>Jx,  y)  . 

Of  course,  we  could  have  m = 0,  so  that  A does  not  appear  (we 
could,  alternatively,  have  n - 0,  but  then  the  algorithm  we  will  discuss 
is  of  no  interest  as  the  problem  is  already  linear).  To  solve  (*4.1),  we 
propose  to  use  an  algorithm  which  generalizes  the  well-known  Newton 
method  for  solving  systems  of  nonlinear  equations  [11].  In  that  method, 
one  proceeds  by  selecting  a trial  solution,  linearizing  the  nonlinear 
function  whose  zero  one  is  trying  to  find,  then  solving  the  resulting 
system.  Thus,  one  retains  the  form  of  the  system  (equations)  but  substitutes 
the  linearized  function  for  the  original  nonlinear  one. 

The  algorithm  we  propose  is  based  upon  the  point  of  view  that 
variational  inequalities  generalize  equations  (which  correspond  to  the 
case  in  which  C is  the  whole  space).  Thus,  in  (4.1)  we  replace  f(x) 
by  its  linearization 

Lfx  (x)  :=  f(xk)  + f'(xk)(x  - xk) 
k 

about  a point  xk  € U,  and  try  to  solve  the  linear  variational  inequality 

0 e Lf  (x)  + Ay  + a + a^Jx,  y)  , (4.2) 

Xk  C 

thereby  retaining  the  original  form  of  the  problem  as  a variational  inequality 
but  replacing  the  nonlinear  function  f by  its  linearization  about  xk> 

This  prescription  raises  obvious  questions,  such  as  whether  the 
sub-problems  (4.2)  will  even  be  solvable  and,  if  they  are,  whether  the 
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solutions  will  converge,  to  what,  and  how  quickly.  We  shall  prove 
here  that  if  the  initial  point  x^  is  chosen  close  enough  to  a solution 
of  H-l)  having  certain  desirable  properties,  and  if  the  succeeding  points 
are  chosen  :rom  a certain  neighborhood,  then  the  sub-problems  (4.1) 
will  be  solvable  and  the  sequence  of  points  will  converge  quadratically 
to  the  solution  set  of  (4.1).  The  result  is  stated  precisely  in  the  next 
theorem,  in  which  we  use  the  notation 

S(  w)  :=  {(x,  y)  6 IRm+n  I 0 e Lfwix)  + Ay  + a + a^x,  y) } . 

THEOREM  3:  x..:,  U,  C,  f,  A and  a be  as  defined  above. 

Suppose  that  f1  is  Lipschitzian  with  modulus  L on  U,  that  [ f'(x ,)  A] 
is  positive  semidefinite,  and  that  for  some  nonempty  compact  convex  set 
**c  JRm,  Si  x ,.)  - (x,}xv. 

Then  for  anv  e > 0 there  exist  6 > 0 and  p > 0 such  tflaliar 
each  x € x^  + pB  one  has,  with  S.  ; S(xJ  + f>B, 

h * S(x)  n S,  C S(x  J + — (X  + e)L  ||  x - x^ll  ^ B , 

where  \ is  a local  upper  Lipschitz  modulus  for  the  polyhedral  multifunction 

Mlz)  {(x,  y)  Iz  e Lf  (x)  + Ay  + a + 9ik-,(x,  y) } . 

x* 

We  remark  that  \ must  exist  (by  Proposition  1). 

PROOF:  We  shall  apply  Theorem  2 with  P :=  U,  N : = IR  x 1R  , 

po  x*’  xo  ;=  * Y*  = S(X«)’  T :=  and 

f(p,  (x,  y))  :=  Lfp(x)  + Ay  + a . 


-24- 


V 


! 


We  note  that  if  y - V then  f7(p  , (x:,,  y ))  - [f'(x  .)  A|,  which  is 
positive  semidefinite  by  hypothesis;  also 

UpQ,(x  , y ))  + f2(pQ,  (x^.y^))!  (x,  y)  - (x..:,  y J ] 

= [ f'Cx^)  A | (x,  y)  + [ f(xj  - f'fxjx*  + a)  , 

so  that  the  constancy  assumption  t b)  of  Theorem  Z holds,  whereas 
assumption  (a)  holds  by  hypothesis.  Applying  Theorem  Z,  we  conclude 
that  there  exist  6 > 0 and  p > 0 such  that  for  any  x'  £ x„  + pB,  the  set 

£(x')  {(x,  y)  c S(x  ) + 6B  |0  £ Lf  ,( x)  + Ay  + a + (x,  y) } 

is  nonempty  and  is  contained  in  S(xJ  I-  (\  + e)ao(x')B.  However, 

a lx')  = max{  !l ( Lf  ((x)  + Ay  + a)  - (Lf  (x)  + Ay  + a)  !l  |(x,  y)  £ X } 

u X X ;t;  u 

= HLfx,(x*)  - Lfx  (x#)|l  = l|f(x#)  - f(x')  - f'(x')(x#  - x')ll  , 


and  a standard  bounding  technique  using  the  Lipschitz  continuity  of  f' 
shows  that  this  quantity  is  bounded  above  by  ~ L || x ' - x , ||  ^ . Thus 

C.  ¥ 

0*  S(x')  n S6  S( x sJ  + ~ (\  + e)L || x*  - x^ll^B  , 

which  completes  the  proof  of  Theorem  3. 

Theorem  3 shows  that  if  we  choose  x with  j! x — x ||  < p and 

a :=  — (\  + e)L.tx  - x II  <1,  then  select  (x  , y ) from  S(x  ) H S , we  have 

c 0 II  0 o 

d[  (x^  yA> , S(x  )]  < j (\  + e)L II xQ  - xjl^  < <x  ij xQ  - xj|  < p . 

If  L = 0 or  x^  = x we  are  finished;  otherwise,  we  have 

j (\  + eJLilXj  - x.J  < l j (X.  + e)L||x0  - xjl  )*  = nZ  < a , 


dm _ ' . 


so  that  Xj  satisfies  the  same  assumptions  as  did  xQ.  Accordingly 

S(x  ) is  nonempty,  and  selecting  (x  , y ) from  S(x  ) H S.  for 
A K'l  K + 1 K 0 

k - 1,  2,  . . . we  find  by  an  easy  induction  that  the  sequence  {(x  , y 

K.  7 1 K 4 1 


exists  and  satisfies 


.k+1. 


d[(Xk+l’yk+l),S(X*))  = 2 + e)L^\  " x*  ^ < 2(k  + e)'1!/1./2  } . 

Hence  the  sequence  <d[  (x^,  y^),  S(xj,.)]  } converges  to  zero,  and  the 
convergence  is  R-quadratic  in  the  sense  of  [11]. 

The  requirement  that  (xk)|.yk+1)  be  chosen  from  S,  maybe 
difficult  or  impossible  to  implement  computationally.  It  can,  however, 
be  avoided  by  placing  more  assumptions  on  the  functions  appearing  in 
the  problem.  For  example,  if  f and  A are  such  that  one  knows  that 
the  solution  set  of  each  sub-problem  (4.2)  is  connected,  then  S.  can 
be  eliminated  since,  in  Theorem  1,  the  upper  semicontinuity  of  lb  and 
the  fact  that  ^(PQ)  = Xq  imply  that  for  p near  pQ  the  solution  set 
of  0 g f(p.x)  + dvj,c(x)  either  lies  entirely  in  X&  or  is  disconnected. 

At  this  point,  the  algorithm  we  have  presented  has  to  be  regarded 
as  an  exercise  in  constructive  real  analysis  rather  than  as  an  effective 
computational  device,  since  large-scale  testing  of  its  effectiveness  on 
real  problems  has  not  yet  been  carried  out.  However,  in  one  special  case 
some  experience  is  available:  for  the  nonlinear  programming  problem  (1.  5) 
the  method  proposed  here  reduces  to  that  of  Wilson  [20],  for  which  some 
computational  results  were  reported  in  [ 1|.  These  were  comparatively 


unpromising,  perhaps  because  Wilson's  code  used  an  early  simplex 
algorithm  for  solving  the  quadratic  programming  problems  to  which  the 
sub-problems  (4.2)  reduce  in  the  case  he  considered.  Later  computational 
investigation  of  a method  similar  to  Wilson's  produced  much  more  promising 
solution  times  [12],  and  analytic  investigation  proved  the  convergence 
to  be  quadratic  under  certain  regularity  assumptions  [13]. 

Actually,  Wilson  proposed  in  a manuscript  [21]  a method  for  solving 
the  following  problem,  which  extends  the  usual  nonlinear  complementarity 
problem; 

H(x,  y)  = 0 
x,  y > 0 
< x,  y>  = 0 

(to  recover  the  nonlinear  complementarity  problem,  take  H(x,  y)  :=  F(x)  - y). 
His  idea  was  to  partition  the  variables  into  basic  and  nonbasic  sets, 
then  solve  a system  of  linear  equations  obtained  by  linearizing  H with 
respect  to  the  basic  variables  while  holding  the  nonbasic  variables 
constant  at  zero.  In  case  this  procedure  resulted  in  negative  basic  vari- 
ables, he  proposed  to  use  a pivoting  strategy  to  eliminate  the  negative 
variable  from  the  basis.  His  procedure  seems  very  close  to  what  would 
be  done  by  the  algorithm  proposed  above  when  applied  to  a nonlinear 
complementarity  problem;  however,  the  author  is  unaware  of  any  formal 
analysis  of  the  convergence  and  rate  of  convergence  of  Wilson's  proposed 
method. 


1 molly,  Kuhn  [ <>]  reported  work  of  Saigal  (privately  communicated) 
on  the  use  of  vector  labelings  in  pivotal  algorithms  for  computing  fixed 
points;  according  to  Kuhn,  Saigal  had  shown  how  the  use  of  such 
techniques  could  lead  to  quadratic  approximations  in  simplicial-subdivision 
techniques.  This  work  may  be  related  to  the  algorithm  we  present  here. 

A i Je note ■ The  work  of  Saigal,  mentioned  above,  appears  in  a 
manuscript  entitled,  "On  the  convergence  rate  of  algorithms  for  solving 
equations  that  are  based  on  methods  of  complementary  pivoting,  " a copy 
of  which  was  kindly  furnished  to  the  author  by  Dr.  Saigal.  It  deals  with 
the  problem  of  finding  a fixed  point  of  a Lipschitz  continuously  differentiable 
function  f : ff<n  - IRn.  Thus,  it  appears  to  be  directly  related  to  the 
work  presented  here  only  for  the  case  in  which  T(x)  is  identically  zero: 
i.e.,  the  case  of  conventional  systems  of  nonlinear  equations,  for  which 
our  algorithm  reduces  to  the  standard  Newton  method. 
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